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Abstract 

Bioinformatics is a relatively new interdisciplinary field that integrates computer science, ma¬ 
thematics, biology, and information technology to manage, analyze, and understand biologi¬ 
cal, biochemical and biophysical information. We present our experience in teaching an inter¬ 
disciplinary course, Introduction to Bioinformatics, which was developed and taught by an in¬ 
terdisciplinary team of computer science and biology faculty. To integrate the "Ethics, Compu¬ 
ting, and Genomics" component into the course curriculum, collaboration with an English fa¬ 
culty member was established. The course was designed as an upper-level elective for com¬ 
puter information systems and computer science majors and was open as a science elective 
for all science majors and for chemical engineering majors. We discuss the course curriculum, 
and a way to integrate the Python programming language in the UNIX environment will be 
presented. The integration of bioinformatics algorithms into the course curriculum will be em¬ 
phasized. 

Keywords: bioinformatics, pedagogy, interdisciplinary collaboration, computer science and 
information systems curriculum, Python, Biopython, ethics, genomics and computing 


1. INTRODUCTION AND 
MOTIVATION 

Bioinformatics is a relatively new interdiscip¬ 
linary field that integrates computer science, 
mathematics, biology, and information tech¬ 
nology to manage, analyze, and understand 
biological, biochemical and biophysical in¬ 
formation. According to a definition created 
by the National Institutes of Health 
( http://www.nih.qov/ j. bioinformatics is re¬ 


search, development, or/and application of 
computational tools and approaches for ex¬ 
panding the use of biological, medical, beha¬ 
vioral or health data, including those to ac¬ 
quire, store, organize, analyze, or visualize 
such data. Bioinformatics is a computational 
science and the subset of the larger field of 
computational biology. Computing and in¬ 
formation technologies have transformed 
research and business in the life sciences, 
facilitating new advances in molecular biolo- 
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gy and consequent new applications in bio¬ 
technology, pharmaceuticals, healthcare, the 
environment, the chemical industry and 
agriculture. Bioinformatics is first a biological 
science. However, bioinformatics has af¬ 
fected more than just biology - it has also 
had a profound impact on the computational 
sciences. Biology has rapidly become a large 
source of new algorithmic and statistical 
problems (Jones & Pevzner, 2004). The IS 
2002 Model Curriculum and Guidelines for 
Undergraduate Degree Programs in Informa¬ 
tion Systems (Gorgone, 2002) notes several 
characteristics of the IS profession that are 
integrated into the curriculum. One of these 
characteristics is "IS professionals must 
have strong analytical and critical thinking 
skills." In our opinion, introducing bioinfor¬ 
matics to CIS students will strengthen these 
required skills. It will also equip the students 
with some of the following capabilities as 
suggested in the IS 2002 guidelines: 

■ Creativity 

■ Application of both traditional and new 
concepts and skills 

• Application development 

■ Problem solving abilities 

■ Ability to communicate effectively (oral, 
written and listening) 

Integrating bioinformatics into the under¬ 
graduate CIS and CS curriculum provides 
opportunities for students to become familiar 
with one of the most widely used script lan¬ 
guages, Python, and to explore various data 
structures and algorithmic techniques tradi¬ 
tionally not covered in other courses. This 
helps students to make connections between 
theoretical topics learned in core CS and CIS 
courses, such as Data Structures and Algo¬ 
rithms, and to apply their knowledge to real 
world biology problems. Introducing bioin¬ 
formatics helps to diversify department 
course offering and provides interdisciplinary 
opportunities for CS and CIS students. In 
addition, CIS and CS students with a bioin¬ 
formatics background clearly will enhance 
their employment qualifications in the com¬ 
petitive job market (Cohen, 2004; Cohen, 
2005a; Cohen, 2005b). 

Bioinformatics has a clear interdisciplinary 
nature. In this paper, we report and discuss 
our experience and course results teaching 
an interdisciplinary course, Introduction to 


Bioinformatics. The first iteration of the 
course was offered in Spring 05. A revised 
version was offered again in Spring 08. We 
discuss the initial implementation, as well as 
revision and updates that were made, course 
results, and future plans. The course was 
taught by an interdisciplinary team of com¬ 
puter science and biology faculty. To inte¬ 
grate the "Ethics, Computing, and Genom¬ 
ics" component into the course curriculum, 
collaboration with an English faculty member 
was established. The course was designed as 
an upper-level elective for computer infor¬ 
mation systems (CIS) and computer science 
(CS) majors and was open as a science elec¬ 
tive for all science and chemical engineering 
majors. Since our department covers both 
curricula (Computer Science and Computer 
Information Systems), many courses are 
designed for both majors. This gives the op¬ 
portunity for CIS majors to partake of a wide 
range of elective courses offered by our de¬ 
partment. 

2. COURSE DESIGN, GOALS, 
OBJECTIVES AND CHALLENGES 

Recently, many universities and colleges 
have started to offer bioinformatics courses 
on different levels and to design bioinformat¬ 
ics minor and major curricula (LeBlanc & 
Dyer, 2003; Burhans & Skuse, 2004; LeB¬ 
lanc & Dyer, 2004; Morrow & Wilkins, 2004; 
Toth & Ronnelly, 2006; Tjaden, 2007; 
Zhang, Lin, Olsen, Beck, 2007; Goode & 
Trajkovski, 2007; Gosukonda & Naghedolfei- 
zi, 2007; Bruhn & Jennings, 2007; Khuri, 
2008). Some universities and colleges de¬ 
sign upper-level elective courses in compu¬ 
tational biology that focus on the design and 
analysis of algorithms with applications in 
molecular biology. Other departments de¬ 
sign upper-level elective courses in bioin¬ 
formatics that pair a biology major with a 
computer science major to learn from the 
challenges in the other discipline from peer- 
to-peer experience. In some situations, the 
course emphasizes the informatics issues 
and can be adapted to fit the interests and 
experience of a faculty member in computer 
science who has only a moderate amount of 
familiarity with molecular biology. While 
there are experiences to learn from, the 
area is still very young, and designing an 
introductory bioinformatics stand-alone 
course takes a lot of effort; the individual 
features of the department should be taken 
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into account, as well as faculty resources 
and available lab resources. The initial goal 
was to design an interdisciplinary course 
that would bring together CS and CIS stu¬ 
dents and undergraduate students from dif¬ 
ferent science majors and would provide an 
opportunity for interdisciplinary collaboration 
in the in-class laboratory assignments and 
team projects. Mainly, we targeted the 
course to biology students. We expected 
that Computer Science and Computer Infor¬ 
mation Systems students would have a solid 
background in programming and algorithms, 
while biology students would have solid biol¬ 
ogy background. Part of the course expe¬ 
rience would be the blending of student ex¬ 
pertise in the formation of teams to solve 
biological problems. With all this in mind, 
the Introduction to Bioinformatics course 
was designed and offered for the first time in 
Spring 2005 with an enrollment of 12 stu¬ 
dents, where 6 of the students were CIS and 
CS majors and 6 students were biology ma¬ 
jors. This balanced enrollment allowed us to 
create six interdisciplinary teams. Each team 
included one biology student and one CIS or 
CS student. Most of the coursework was 
done in teams. The course was taught by 
computer science and biology faculty. The 
course met in the lab 6 hours weekly. The 
development of this course was a challeng¬ 
ing task for several reasons. First, the di¬ 
verse background of the students had to be 
taken into account. Second, the program¬ 
ming language to be used in this course had 
to address the current and future needs of 
computational molecular biology, and at the 
same time had to be easy enough for the 
students to learn and to apply in bioinfor¬ 
matics problem solving. Third, careful 
thought had to be put into defining course 
prerequisites. Fourth, the goal was to design 
a well-balanced course from the content 
point of view, and to create a proper ratio 
between hands-on activities and lecture. 
Fifth, we experienced difficulties in finding 
one textbook that would answer all our 
needs. The following prerequisites for the 
course were determined for different majors: 
Biology/Chemistry/Biochemistry majors had 
as a prerequisite Introduction to Molecular 
Biology, CS/CIS/MATH majors had prerequi¬ 
site Introduction to Computer Science I, and 
Chemical Engineering Majors had as a pre¬ 
requisite Computer Programming and Engi¬ 
neering Problem Solving. After thoughtful 
consideration, Python was chosen to be a 


programming language for the course. There 
were several reasons for choosing this lan¬ 
guage for the bioinformatics course: simplic¬ 
ity, power, and object-oriented capabilities. 
Also, some biology students can take an In¬ 
troduction to Programming with Python for 
non-majors offered by our department prior 
to the bioinformatics course and it would 
provide programming foundation knowledge 
for biology students who need it. In addition, 
most CS and CIS students who would take 
the bioinformatics course would not have 
had any prior knowledge of Python, and the 
bioinformatics course would provide an op¬ 
portunity to learn Python and enrich their 
undergraduate curriculum; recently, the Py¬ 
thon language has started to be used more 
often in bioinformatics as a general-purpose 
programming language (Mangalam, 2002). 
We were also able to introduce students to 
the Biopython Project, an international asso¬ 
ciation of developers of freely available Py¬ 
thon tools for computational molecular biol¬ 
ogy (http//biopython.org). 

It is critical for students to understand bio¬ 
logical problems as well as computational 
solutions in order to produce useful bioin¬ 
formatics tools. Since bioinformatics is a 
computational science, it is important to in¬ 
troduce students to the principles that drive 
an algorithm's design. One of the important 
goals of the course was to introduce stu¬ 
dents to the intellectual content of bioinfor¬ 
matics. To achieve this goal, it was decided 
to integrate bioinformatics algorithms into 
the course and to teach the foundations of 
the algorithms and important theoretical 
results in bioinformatics along with end-user 
bioinformatics tools (Jones & Pevzner, 
2004). For the first iteration, the biology 
faculty member taught 4 hours per week 
and the computer science faculty member 2 
hours per week and the idea was to blend 
biology and computer science topics togeth¬ 
er. However, while keeping the main goal to 
blend computer science and biology topics, 
we realized that there was more computer 
science material to cover, and the ratio was 
thus changed to 2 hours per week for biolo¬ 
gy topics and 4 hours per week for computer 
science topics for the second iteration of the 
course in Spring 08. Additionally, the course 
curriculum was updated and revised based 
on the initial experience and the enrollment 
changes were taken in account. We were 
prepared for the reality that only CIS and CS 


© 2010 EDSIG 


http://isedj.org/8/27/ 


June 14, 2010 



ISEDJ 8 (27) 


Kortsarts, Morris, and Utell 


6 


students were enrolled in the course in 
Spring 08. This gave us an opportunity to 
expand the programming/algorithms com¬ 
ponent of the course, to have a more com¬ 
plex final project and also to integrate a few 
additional topics. We will discuss the curricu¬ 
lum issues in detail in the following section. 

3. COURSE CURRICULUM 

The course started with an introductory lec¬ 
ture that provided several definitions of the 
term "bioinformatics" to give students an 
idea of what the course was about. We also 
explained the structure of the course, the 
tentative list of topics that would be cov¬ 
ered, and the level of the theoretical and 
technical content. We also emphasized the 
interdisciplinary nature of the topic and of 
the course. The first topic covered in the 
course was "Ethics, Computing and Genom¬ 
ics". This was a project-oriented component 
and was a new addition integrated into the 
Spring 08 curriculum. The idea of this 
project was to give students an opportunity 
to learn about the ethics, computing and 
genomics topic independently and to present 
the results of the self-learning. To learn the 
topic, students were assigned one or more 
scholarly articles from the collection Ethics, 
Computing, and Genomics, edited by Her¬ 
man Tavani. The collection is divided into 
five different sections, with introductions and 
lists of questions prior to each section. Stu¬ 
dents were required to read assigned es¬ 
says; to prepare 25-minute Power Point 
presentations that would include a summary 
of the paper and answers to the questions 
posed in the introductory part of the corres¬ 
ponding section; and to prepare a mini-quiz 
to assess the understanding of the pre¬ 
sented material by their peers. The goal of 
this component was to develop oral and 
written communication skills and to engage 
students in the knowledge exchange 
process. To accomplish these goals, an in¬ 
terdisciplinary collaboration with an English 
faculty member was established. The team 
of computer science and biology faculty met 
with the English professor and discussed the 
goals, structure and requirements of the 
Ethics, Computing and Genomics compo¬ 
nent, and decided that in order to improve 
students' performance, the English faculty 
member would do a short presentation be¬ 
fore students started to work on this as¬ 
signment. The presentation covered the fol¬ 


lowing issues: (a) discussion of how to read 
critically and what questions to ask while 
reading the text; (b) discussion of how to 
summarize the paper using the structure of 
the essay as a guide and elucidating key 
points and key moments of evidence while 
making connections to the rest of the class 
material; (c) tips on writing the summary 
that include three steps: prewriting, draft¬ 
ing, revising; and (d) discussion of how to 
design an effective presentation of informa¬ 
tion. The English faculty member was 
present at all oral presentations and pro¬ 
vided detailed notes for each student ex¬ 
plaining ways the presentation could have 
been stronger and also pointing out the posi¬ 
tive and negative aspects of the presenta¬ 
tion. These valuable comments allowed stu¬ 
dents to improve their performance and 
their oral and written communication skills, 
and to complete the course with a well- 
designed presentation for the final course 
project that will be discussed later in the 
paper. The collaboration with the English 
faculty member provided an opportunity to 
accomplish the goals related to the devel¬ 
opment of oral and written communication 
skills. This successful and enjoyable expe¬ 
rience showed the value of working with col¬ 
leagues across disciplines to further student 
learning. The reading and research paper 
presentation assignment contributed 20% to 
the final course grade and students were 
given three weeks to work on this project. 
The rest of the course was taught by com¬ 
puter science and biology faculty. The list of 
the topics with explanations about the spe¬ 
cific activities and ideas for resources is giv¬ 
en below. The goal was to blend biology and 
computer science topics together to create a 
real-world atmosphere in the course where 
students could observe how the same prob¬ 
lem was tackled from multiple perspectives. 

a. Introduction to Python 

One goal was to acquaint students with in¬ 
troductory Python very quickly during the 
first few weeks of the course, which would 
allow for working on different problem solv¬ 
ing algorithmic techniques. Advanced Python 
topics were taught later throughout the 
course, building students' knowledge and 
their abilities to tackle biology real-world 
problems. The programming examples were 
all biology-oriented and motivated students 
to learn in order to solve practical problems. 
During the first few weeks of the course, we 
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introduced introductory topics such as Py¬ 
thon arithmetic, decision and loop structures 
and functions, and as well simple manipula¬ 
tions with strings, lists, tuples and dictiona¬ 
ries. In Spring 05, we had six biology stu¬ 
dents without any prior programming expe¬ 
rience and six CS and CIS students without 
any prior experience in Python. In order to 
make this part of the course successful, the 
class was divided into six interdisciplinary 
teams and all concepts were practiced within 
the team with the help of CS and CIS ma¬ 
jors. In Spring 08, all students enrolled in 
the course were CS and CIS majors with 
prior programming experience in C and Java, 
and some with introductory knowledge in 
Python, as well. Special handouts were pre¬ 
pared to walk students through the introduc¬ 
tory topics toward advanced Python con¬ 
cepts. Each topic was supported by a list of 
examples in increasing order of complexity. 
Students were required to run the proposed 
programs in order to gain understanding of 
basic Python structures. To assess the un¬ 
derstanding of each concept, students were 
required to write short programs solving bi¬ 
ology-oriented problems. A few examples of 
the problems, given here in increasing level 
of complexity, include computation of the 
alignment score between two DNA se¬ 
quences using different score matrices; find¬ 
ing the maximal alignment score if no inter¬ 
nal gaps are allowed using different score 
matrices; finding all occurrences of one se¬ 
quence in another sequence; writing a pro¬ 
gram that reads a DNA sequence, first tran¬ 
scribing DNA into RNA and printing the re¬ 
sulting RNA sequence, then translating RNA 
into a protein sequence through the follow¬ 
ing: first, the program divides RNA into co¬ 
dons and prints the list of codons, and 
second, the codons are translated into the 
protein using genetic code table and finding 
the maximal alignment score if internal gaps 
are allowed using different score matrices. 
In Spring 08 we had students with different 
levels of programming and computational 
experience and the best way to cover this 
topic was through independent learning. 
Students used provided handouts and Py¬ 
thon and BioPython tutorials 
( www.python.org , www.biopvthon.orq j and 
worked each at their own pace. We provided 
grading rubrics for each programming con¬ 
cept, specified minimal requirements to pass 
the specific concept, and provided a list of 
more advanced examples for students with 


prior Python experience. This approach al¬ 
lowed students with previous Python know¬ 
ledge to further advance their experience 
and students new to Python to learn the new 
programming language independently using 
structured guidance. Python also provides 
an opportunity to solve some problems in 
very short ways, and it was a very enjoyable 
experience for students to try to find a 
shortest solution for the proposed problems 
using Python functions and libraries. 

b. Introduction to Bioinformatics 
Algorithms 

To introduce students to the principles that 
drive an algorithm's design and to the intel¬ 
lectual content of bioinformatics, the bioin¬ 
formatics algorithms component was inte¬ 
grated into the course (Jones & Pevzner, 
2004). Students were introduced to the top¬ 
ics of sequence alignments, scoring matric¬ 
es, and gaps. We covered exhaustive search 
and dynamic programming algorithm design 
techniques. To support the dynamic pro¬ 
gramming techniques with specific exam¬ 
ples, the Needleman and Wunsch algorithm 
(Jones & Pevzner, 2004; Krane and Raymer, 
2002) for finding an optimal global align¬ 
ment and the Smith-Waterman algorithm 
(Jones & Pevzner, 2004; Krane and Raymer, 
2002) for finding an optimal local alignment 
were introduced. We also discussed the con¬ 
cept of semi-global alignment and the slight 
modification of the Needleman and Wunsch 
algorithm for this purpose. Dynamic Pro¬ 
gramming technique usually is not covered 
in a core algorithms course, and including 
this topic in our course curriculum provided 
an opportunity to expand the theoretical 
background and to make connections be¬ 
tween theory and practice. It also helped to 
maintain an appropriate level of theoretical 
content required for upper-level elective 
courses in our department. Additionally, this 
topic was very well blended with biology top¬ 
ics and students had an opportunity to learn 
the concept of sequence alignments from 
biology and computer science points of view. 
It is a challenging topic for the instructor to 
teach and for the students to learn. For the 
first iteration of the course, the topic was 
studied in interdisciplinary teams, similar to 
the idea that was employed to teach Python. 
To enhance students' understanding of the 
theoretical aspects, the visualization of bio¬ 
informatics algorithms through the ALGGEN 
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EMBER website was used 
( http://alqqen.lsi.upc.es/docencia/ember/fra 
me-ember.html ). This website provides a 
suite of multimedia bioinformatics educa¬ 
tional tools and allows us to create a set of 
hands-on activities to help students to gain 
understanding of the dynamic programming 
technique in general and specific algorithms 
in particular. In this part of the course we 
also introduced the BLAST algorithm and the 
concept of multiple sequence alignment, and 
provided students with a brief theoretical 
background. We mainly focused on practical 
applications of these tools to solve biological 
problems. BLAST was the main tool that 
students used to complete a final course 
project. The multiple sequence alignment 
tool CLUSTALW was used in some of the 
hands-on activities of the biology part of the 
course, which are described below. 

c. Biological Research on the Web 

In this part of the course, which focused on 
both biology and computer science, students 
got a familiarity with public biological data¬ 
bases and data formats, and learned how to 
search biological databases to find required 
information (Gibas & Jambeck, 2001). The 
main website that students worked with in 
this course was NCBI - National Center for 
Biotechnology Information 

( http://www.ncbi.nlm.nih.gov/ ) - the site 
that was established in 1988 as a national 
resource for molecular biology. 

d. Biology Topics 

The biology part of the course started with a 
review of molecular biology and biochemi¬ 
stry concepts that covered DNA and protein 
structure, gene expression (transcription and 
translation), and molecular biology central 
dogma. Next, the concept of sequence 
alignment was introduced and discussed 
from different perspectives (Krane & Ray- 
mer, 2002). Students learned to use se¬ 
quence alignment to understand relatedness 
among species and to use sequence align¬ 
ment forensically. The hands-on activities to 
support this part of the course included the 
exploration HIV evolution lab developed by 
Sam Donovan and Anton E. Weisstein from 
Microbes Count!, which is a collection of 
multimedia resources, simulations, and tools 
that offer an interactive, open-ended envi¬ 
ronment for learning about microbiology 
(Jungck & Stanley, 2003). Human Immu¬ 


nodeficiency Virus (HIV), like other retrovi¬ 
ruses, has a much higher mutation rate than 
is typically found in organisms that do not 
go through reverse transcription (the copy¬ 
ing of RNA into DNA). The HIV genome is 
very small and relatively simple. It is made 
up of nine genes and about 9,500 nucleo¬ 
tides. In this lab students worked with HIV 
sequence data collected from 15 individuals 
from an intravenous-drug-using population 
in Baltimore. The goal of the study was to 
determine if the HIV isolated from particular 
subgroups of subjects derives from a com¬ 
mon source. In order to approach this ques¬ 
tion students needed to characterize the 
strains of HIV within an individual and quan¬ 
tify the differences between individuals (Do¬ 
novan & Weisstein, 2003). To accomplish 
the goals of the lab, students used Biology 
Workbench ( http://workbench.sdsc.edu ), a 
free, web-accessible suite of resources for 
working with molecular sequence and struc¬ 
ture data. Biology Workbench provides a 
unified interface to access a variety of tools 
and databases; it is online and can be ac¬ 
cessed with a standard web browser and it is 
available at no cost to academic users. The 
Biology Workbench was developed by Shan¬ 
kar Subramaniam and currently resides on a 
supercomputer at the San Diego Supercom¬ 
puting Center. Thus, it provides access to 
both extensive computational power and 
huge data storage capacity. To complete the 
project the CLUSTALW multiple sequence 
alignment tool was used. In this part of the 
course, students also learned about microar¬ 
rays. The microarray lab was developed by 
Campbell and Heyer and is sold by Carolina 
Biologicals and called DNA Chips: Genes to 
Disease. An interesting lab activity was de¬ 
signed to help students to understand how 
microarrays are used to identify gene 
changes in disease. This simulation provided 
an opportunity to introduce students to mi¬ 
croarrays, the complexities of gene expres¬ 
sion, and the role of gene expression in can¬ 
cer. Using simulated microarray technology, 
students compared the relative expression 
levels of six different genes in healthy lung 
cells and lung cancer cells. After completing 
the lab, students had an opportunity to dis¬ 
cuss the significance of the relative expres¬ 
sion levels with respect to the genes' roles in 
causing cancer (Heyer & Campbell). In addi¬ 
tion, the biology part of the course provided 
an opportunity to use computer simulations 
to test hypotheses about disease spread. 
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Students learned the concept of epidemiolo¬ 
gy, the study of the distribution of diseases 
in populations and explored factors that in¬ 
fluence disease spread throughout popula¬ 
tions with the software Epidemiology. Ebola 
was used as a model organism and epidemi¬ 
ology was presented from both a microbio¬ 
logical and social perspective (Jungck & 
Stanley, 2003). Also, students used a vast 
computer database of genetic information to 
explore the structure and function of insulin 
and to generate a phylogenetic tree demon¬ 
strating evolution of insulin amongst the 
vertebrates (Campbell & Reece, 2004). 

4. COURSE PROJECT 

In the final course project students were 
engaged in the process of DNA sequence 
annotation. Students worked with the real 
data. The microbial genome resources were 
used for these purposes and Bacillus anthra- 
cis str. Ames project at J. Craig Venter Insti¬ 
tute was used as a specific organism 
(http://www.ncbi.nlm.nih.gov/). The input 
DNA sequence was about 50,000 nucleotides 
long, and students worked on different such 
sequences from the same organism. The 
project consisted of several steps. First, stu¬ 
dents were asked to find a list of all potential 
genes and pseudo-genes in the input DNA 
sequence, using the information about start 
and end codons, and to arrange the found 
sequences in two separate lists, one for po¬ 
tential genes (length of the sequence is 
larger than 300) and one for pseudo-genes 
(length of the sequence is less than 300), in 
order of increasing length. Second, students 
located the potential promoters in the given 
DNA sequences for each potential gene that 
they found in the first step, and calculated 
the strength of the promoter. A promoter is 
a region of DNA near the beginning of a 
gene that controls if and when the gene is 
actually expressed. As an output for this 
step, students were asked to list potential 
genes in order of decreasing promoter 
strength. 

To complete the project students were asked 
to BLAST all potential genes and pseudo¬ 
genes that were found, and to perform an 
analysis of the results. Students were re¬ 
quired to summarize the results in a table 
and provide the following information for 
each potential and pseudo-gene that they 
found: start position, length, promoter 


score, BLAST results, summary and conclu¬ 
sion. For each sequence, we asked students 
to determine whether a potential gene could 
be a real gene based on the strength of the 
promoter and BLAST results. Students were 
asked to present project results in 15- 
minute in-class presentations that included 
the presentation of the Python program, in¬ 
cluding the description of all Python func¬ 
tions that were used and the purpose of 
each function, as well as all algorithms 
or/and programming techniques that were 
used and the presentation and explanation 
of the summary of the project results, in¬ 
cluding the information about the specific 
organism whose DNA was used as the input. 
In the first iteration of the course, this was a 
team project. Each team consisted of a 
computer science and a biology student. For 
the first iteration, the programming part of 
the project was mostly done by the comput¬ 
er science students, and the biology stu¬ 
dents were required to understand and to 
explain the programming techniques and 
algorithms that were used. The project pro¬ 
vided a possibility for truly interdisciplinary 
collaboration between computer science and 
biology students. In the second iteration of 
the course, in Spring 08, the students 
worked on the project individually. For fu¬ 
ture iterations of the course we are planning 
to return to team work in the project in or¬ 
der to enhance the collaborative component 
of the course. 

5. COURSE RESULTS 

For the first iteration of the course no formal 
assessment survey was conducted. An in¬ 
formal discussion about the course was con¬ 
ducted at the end of the semester and we 
asked students to provide their feedback. 
Also, as part of our regular departmental 
procedure, students completed teaching 
evaluations and provided their comments 
there as well. All students showed satisfac¬ 
tion with the course and we were very 
pleased to receive the request to extend the 
programming component of the course from 
almost all students. Biology students showed 
interest in programming and asked that an 
environment be created where they would 
be able to more fully participate in all stages 
of the course project. In the second iteration 
of the course, we designed a short post¬ 
survey in order to assess the students' expe¬ 
rience which included a list of the topics that 
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were covered in the course. We asked stu¬ 
dents to rate the level of learning for each 
topic on a scale of 1 (not well) to 5 (very 
well). Six students were enrolled in the 
Spring 08 and the table with the list of the 
topics and the average ratings is given in the 
Appendix. 

We also asked students to comment on the 
Ethics, Computing and Genomics component 
and received positive feedback from most of 
the students. In addition, we asked students 
to provide any comments regarding the 
course, and most of the students mentioned 
that they loved the course and would rec¬ 
ommend it to their peers; they expressed 
their satisfaction with the level of the course 
and the amount of material covered and the 
depth of the coverage. They also mentioned 
that the final project was very interesting 
but at the same time they proposed that we 
be more careful with the project description 
and to provide clear rules for finding genes 
on the main and complement strings to 
avoid confusion. 

6. LESSONS LEARNED AND FUTURE 
PLANS 

Based on the students' post-survey results, 
informal discussions, and comments from 
teaching evaluations, we could state that the 
integration of an Introduction to Bioinfor¬ 
matics course into the CS/CIS curriculum 
was successful. For future iterations of the 
course we are planning to follow the existing 
structure in general, but based on our expe¬ 
rience and taking into account some of the 
student's suggestions, several changes will 
be introduced. As shown in the assessment 
table in Appendix, all topics that were cov¬ 
ered in the second iteration of the course 
were learned on an above average level, but 
it is easy to see that some of the topics re¬ 
quired our special attention and should be 
revised for future iterations. The teaching 
approach will try to foster student learning 
through a research-based process. We will 
try to blend biology topics with computer 
science topics throughout the course more 
effectively, to make sure that students have 
a clear motivation for each CS/CIS task, and 
to make sure that each topic receives cover¬ 
age from the two different perspectives. In 
addition, we think that a guest speaker from 
the field could enrich the course curriculum 
and we will try to arrange the talk or per¬ 


haps even a visit to a bioinformatics lab fa¬ 
cility. We will continue to emphasize the in¬ 
terdisciplinary nature of the course and will 
try to design more collaborative teamwork 
even if the course enrollment consists of 
CS/CIS majors only. We will continue the 
integration of hands-on activities to make 
the course a fun and enjoyable experience. 
We will definitely continue the integration of 
the Ethics, Genomics and Computing com¬ 
ponent, but we would try to find different set 
of papers to stay current in the field and also 
to cover a wide range of aspects related to 
the topic. We will work on improving the 
post-survey by asking more specific ques¬ 
tions. We will also try to attract more stu¬ 
dents to the course which will allow a larger 
sample for the course assessment. A special 
word should be said about the textbook. It 
was not an easy task to find one textbook 
that would answer all our needs since our 
course has several unique features; when 
combined together they didn't allow for an 
easy solution to the problem of textbook 
choice. For both initial iterations of the 
course we used several textbooks (see Ref¬ 
erences section for the complete list of all 
textbooks that were used for the course). 
For succeeding iterations of the course we 
are planning to design our own custom text 
for the course using several textbooks and 
our own lecture notes with the help of a pro¬ 
fessional publisher. While the two initial im¬ 
plementations of the course were successful, 
all the above-mentioned ideas would defi¬ 
nitely help to improve the course and we are 
looking forward to the future iterations. 
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APPENDIX 

Table 1: The results of the students post survey - list of topics with the 


average ratings for level of learning 

Number of Students that Answered is 6 


Topic 

Score 

1. Introductory Python and ability to design simple Python programs 

3.7 

2. Advanced Python topics: functions, loops, if-else statements, string manipu¬ 
lations, lists, and list manipulations 

3.5 

3. Designing complex Python programs using advanced Python features 

3.3 

4. Understanding the concept of sequence alignment: global, local, semi-global, 
multiple sequence alignment 

3.2 

5. Understanding dynamic programming algorithmic technique 

3.7 

6. Understanding Exhaustive Search (brute force) algorithmic technique 

4 

7. Understanding Needleman-Wunsch algorithm and be able to trace the algo¬ 
rithm to produce the final result 

3.8 

8. Understanding Smith-Waterman algorithm and be able to trace the algorithm 
to produce the final result 

3.8 

9. The ability to work independently on the research - based project applying 
computer science and biology knowledge to solve problems 

4.3 

10. Understanding how to use BLAST tool and to read the results of BLAST 

4.2 

11. Using sequence alignments to understand relatedness among species 

3.8 

12. Using sequence alignments forensically (HIV experiment) 

4.2 

13. Understanding how microarrays are used to identify gene changes in dis¬ 
ease 

3.3 

14. Understanding the flow of information from DNA to protein 

3.3 

15. Using computer simulations to test hypotheses about disease spread 

3.8 

16. The ability to read a research paper in the Ethics, Computing and Genomics 

3.8 

17. The ability to communicate effectively through the participation in the Eth¬ 
ics, Computing and Genomics project 

4 

18. The ability to create an informative power point presentation to present the 
results of the Ethics, Computing and Genomics project 

4.3 

19. The ability to learn the topic by yourself and the ability to present results of 
learning in clear way 

3.8 
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